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1 HISTORICAL INTRODUCTION 

In this note we present an elementary inductive proof which Euler could have 
obtained, for his assertion that every prime of the form 20n + 1 or 20n + 9 is 
a sum + 5y^, had he refined a bit his proof for Format's theorem that every 
prime of the form 4n + 1 is a sum of two squares. 

Here and throughout this note all letters are assumed to stand for nonneg- 
ative integers, unless otherwise specified. It is our pleasure to start by briefly 
reviewing the story told by Cox in the nice book [2] . 

Pierre de Format (1601-1665), who had done pioneering work on representing 
primes as + ny^, stated, but did not write down a proof, that he had proved 
by his favorite method of infinite descent the following: 

(i) Every prime p = 1 mod 4 is a sum x'^ + y^. 

(ii) Every prime p = 1, 3 mod 8 is a sum + 2y^. 

(iii) Every prime p = 1 mod 3 is a sum x'^ + 3j/^ . 
He also conjectured but could not prove that 

(iv) The product of two primes, each of which is = 3, 7 mod 20, is a sum 
x"^ + 5y^. 

Lconhard Euler (1707-1783) heard of Format's results and spent 40 years 
proving (i)-(iii) and considering their generalizations. This finally led him to 
the discovery of the quadratic reciprocity, although he could not provide a solid 
proof for it. By working out numerous examples on representing primes as 
x"^ + ny^ for various n, he discovered more patterns. Some of his discoveries 
which he could not prove are: 

(v) Every prime p = 1, 9 mod 20 is a sum x'^ + by"^. 

(vi) For every prime p = 3,7 mod 20, 2p is a sum + 5y^. 

(vii) A prime p = x"^ + 27y^ if and only if p = 1 mod 3 and 2 is a cubic 
residue modulo p. 

(viii) A prime p = x'^ + 6Ay^ if and only if p = 1 mod 4 and 2 is a biquadratic 
residue modulo p. 



Joseph-Louis Lagrange (1736-1813) and Adrien-Marie Legendre (1752-1833) 
later developed the form theory as well as the genus theory to prove (iv)-(vi). 
Indeed, they could prove (v) and that 

(v') Every prime p = 3,7 mod 20 is a sum 2x^ + 2xy + 3y^ (where one of 
X, y may be negative). 

Then (iv) and (vi) follow immediately from the following two identities: 

(2x^ + 2xy + 3y^)(2a^ + 2ab + 36^) = {2ax + bx + ay + 3byf + 5{bx - ayf; 
2{2x^ + 2xy + 3y^) ^ {2x + yf + Sy^ 

Both Legendre and Lagrange, however, could prove neither (vii) nor (viii). 

It was Carl Friedrich Gauss (1777-1855) who had finally tackled (vii) as well 
as (viii) using his cubic and biquadratic reciprocities. And, years before this, 
it was also Gauss who gave the first rigorous proof of the quadratic reciprocity. 
The interested reader is referred to [5] to enjoy the rest of the story. 

What we shall show is that in fact Euler could have proved (iv)-(vi) had 
he just refined his proof of (i)~(iii); and hence the above told story would be 
somewhat different. 

2 A REVIEW OF EULER'S PROOF 

Let us first briefly review Euler's proof. According to [1], a version of Euler's 
proof of (i) goes as follows. For prime p = 1 mod 4, there is an r < p such 
that pr = x^ + 1. For each prime factor q of r, since —1 is a quadratic residue 
modulo q, it follows that either q = 1 mod 4 or g = 2. We assume by induction 
that each such g is a sum of two squares. Then a cancelation lemma. Lemma 
[1] below, enables to cancel the prime factors of r one by one. As a result, one 
obtains a representation of p as a sum of two squares. 

By a (1, n)-representation we mean an expression of the from + ny^. The 
following lemma appears as Lemma 1.4 in [5]. Here, for the convenience of the 
reader, we include the proof given in [5] in a slightly shortened form. 

Lemma 1 Suppose p and pr each has a {l,n)-representation, where p is a 
prime. Then r has a {l,n) -representation. 

Proof. Suppose p = + nb^, and pr = x'^ + ny^. Then 

p'^r ~ {ax ± nbyY' + n{ay =p bx^ . 

Note that p \ {ay — bx){ay + bx) since 

{ay — bx){ay + 6a;) = (a^ + nb'^)y'^ — b'^{x'^ + ny"^). 

It follows that either p \ ay — bx or p \ ay + bx; correspondingly, p \ ax + nby or 
p \ ax — nby. Consequently, we have one of the following holds: 

r = {{ax + nby)/pY + n{{ay — bx)/p)'^, 
r = {{ax — nby)/pY + n{{ay + bx)/pY . 
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This proves Lemma [T] □ 

It is not hard to see that Euler's proof also applies to the cases of + 2y^ 
and + 3?/^ after minor modifications. This is because in the representation 
pr = + 2 (resp. pr = x"^ + 3) all of the prime factors of r (with one or two 
exceptions which are easy to deal with) are of the desired type, so we can again 
use Lemma [1] and the inductive hypothesis to cancel them one by one. 

However, it is not the case for x"^ + by^ . Note that for pr = x^ + 5, where 
we may assume that 5 | r, each prime factor q of r is such that either q = 
1,9, 3, 7 mod 20 or g = 2, hence Lemma [T] is not enough to cancel all prime 
factors q of r; we have to deal with those g = 3, 7 mod 20 and q = 2. Indeed 
we do have such a cancelation lemma (Lemma [2] in ^ which enables us, with 
the help of a small trick, to cancel such factors pair by pair under the inductive 
hypothesis that (iv)-(vi) hold for all primes q < p such that q= 1, 9, 3, 7 mod 20 
or g = 2. It turns out that we must prove (iv)-(vi) simultaneously by induction. 
The rest of this note consists of the detailed statements and proofs. 

3 A SECOND CANCELATION LEMMA 

For the convenience of further exposition, we make the following definition. 

Definition. A (1, n)-representation is said to be nontrivial if both of x and y 
arc nonzero; it is proper if x and y are relatively prime. 

Remarks. The following items (a)-(f) can be easily checked. 

(a) A proper (1, ?i)-representation is automatically nontrivial unless it equals 
1 or 71. 

(b) A (1, n)-representation of a primep, wherep ^ n, is automatically proper 
and nontrivial. 

(c) A nontrivial (1, 7i)-representation of the product of two primes is always 
proper. 

(d) If p is a prime such that p \ r and p \ n, then any (1, n)-representation 
of pr is nontrivial. 

(e) There is the following very useful Euler identity which expresses the 
product of two (1, n)-rcprcscntations as a (1, n)-representation in two ways: 

(a^ + nb'^){x^ + ny^) = {ax ± nhyY + n{ay =F hx)"^ . (1) 

(f) If an odd s, where 5 j" s, has a nontrivial, proper (1, 5)-representation 
8 = 0^ + 56^, then 

s^ = (a^ - bh^f + h{2ahf 

is a nontrivial, proper (1, 5)-represcntation, since every prime common factor of 
a? — 56^ and 2ah is a common factor of a and h. 
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Our second cancelation lemma, Lemma below, enables us to cancel , where 
q is an odd prime and q^ has a nontrivial (1, n)-representation, from a given 
(1, n)-representation of q^r and obtain a (1, n)-representation of r. 

Lemma 2 Suppose q^ has a nontrivial {1, n)-representation, where q is an odd 
prime. If q^r has a {l,n)-representation, then r has a {l,n) -representation. 
Moreover, if q^r has a nontrivial, proper (l,n) -representation and r ^ l,rt, 
then r has a nontrivial, proper {l,n) -representation. 

Proof. Suppose = + nb'^ is nontrivial. Then it is proper by Remark (c). 
Let q^r — + ny^. From the Euler identity ((T|), we have 

q'^r = (ax ± nby)^ + n{ay ^ bx)'^ . 

Then q^ \ {ay — bx){ay + bx) as in the proof of Lemma [TJ 

First, suppose q^ \ ay — bx and f ay + bx. Then wc must have q \ ay — bx 
and q \ ay + bx. Hence q \ 2ay. Since q is odd, q \ ay. We then have q \ y since 
q \ a (otherwise a = 0, or a = q and & = 0, a contradiction). Similarly, q \ x. 
Consequently, we have 

r = {x/qf +n{y/qf. 

In this case q^r = x"^ + ny"^ is not proper. 

Now we may suppose q^ \ ay — bx or q^ \ ay -\- bx. Then q^ \ ax -\- nby 
or q^ \ ax — nby accordingly. Consequently, we have one of the following two 
holds: 

r = {{ax + nby)/q^f + n{{ay - bx)/q^f, (2) 
r = {{ax - nby)/q^f + n{{ay + bx) / q^f . (3) 

Claim L The above obtained (1, n)-representation of r is nontrivial and proper 
if so is q^r = x"^ + ny^ . 

Proof of Claim L In the case where q^ \ay — bx and q^ \ ax-\- nby, it follows 
from the identities 

X = a{{ax + nby)/q^) — nb {{ay — bx)/q^), 
y^b {{ax + nby)/q^) + a{{ay - bx)/q^) 

that {ax + nby)/q^ and {ay — bx)/q^ are relatively prime since so are x and y. 
Hence the (1, n)-representation ^ is proper. 

In the case where q^ \ ay-\-bx and q^ \ ax — nby, it follows from the identities 

X = a{{ax — nby)/q^) + nb {{ay + bx)/q^), 
y = -~b{{ax - nby)/q^) + a{{ay + bx)/q^) 

that {ax — nby)/q^ and {ay + bx)/q^ are relatively prime since so are x and y. 
Hence the (1, n)-representation ([3]) is proper. 

Since r ^ in either case the (1, n)-representation of r is nontrivial by 
Remark (a). Claim I is thus proved. 
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This completes the proof of Lemma [5] 



□ 



The case where q = 2 and n = 5 is simple and is considered in the following 

Addendum to Lemma [2l If 2^r has a (1, 5)-representation 2^r ~ x'^ + 5y^, 
then X and y must be both even (by a simple modulo 4 argument), and hence 
r = (x/2)^+5(2//2)^. Furthermore, iix'^+by'^ is nontrivial, so is (x/2)^+5(?//2)^. 

4 THE PROOF THAT EULER MISSED 

For convenience of later reference, we restate the assertions (iv)-(vi) in iJT]as 

Theorem. (1) Every prime p= 1,9 mod 20 has a {l,5)-representation. 

(2) For every pair of primes q, q' such that g = 3, 7 mod 20 and either q' = 
3, 7 mod 20 or q' ~ 2, their product qq' has a nontrivial (1, 5) -representation. 

It is the following inductive proof that Euler missed. 

Proof. Suppose by induction that (1) and (2) hold for all primes p,q,q' which 
are less than a certain prime tt where tt = 1, 9, 3, 7 mod 20. We need to show 
that 

(1) ^ If TT = 1, 9 mod 20 then tt has a (1, 5)-representation. 

(2) ^ If TT = 3,7 mod 20 then for every prime q < tt such that g = 3, 7 
mod 20 or q = 2, nq has a nontrivial (hence proper) (1, 5)-representation. 

To start, we have from the quadratic reciprocity that for a prime p ^ 2,5, 

—5 is a quadratic residue mod p p = 1, 9, 3, 7 mod 20. 

Hence there is a (1, 5)-representation 

7rr = a;^ + 5y^. (4) 

Here (U) initially holds for some x < (tt — l)/2 and y = 1; it follows that 
x^ + 5y^ < TT^ and hence r < tt. After reduction if necessary, it can be assumed 
that 5f r and that is a nontrivial, proper (1, 5)-representation. 

Claim II. When r in ([5]) is minimized, we have either r ~ I or r = q' , where 
q' is a prime such that cither q' = 3,7 mod 20 or g' = 2. 

Proof of Claim II. Since —5 is a quadratic residue mod p, for each prime 
factor q of r, we have q < n and either q = 1, 9, 3, 7 mod 20 or q = 2. Our idea 
is to manage to cancel the prime factors of r one by one for those congruent to 
1, 9 modulo 20, and pair by pair for those congruent to 3, 7 modulo 20 or equal 
to 2. 

If r has a prime factor q such that q = 1,9 mod 20, then, by the inductive 
hypothesis, q has a (1, 5)-representation. By Lemma[Tl nr' , where r' = r/q, has 
a (1, 5)-representation. Hence, by minimizing r in we may assume that r 
has no prime factors congruent to 1,9 modulo 20. 

Now each prime factor g of r is of the form either q = 3,7 mod 20 or q = 2. 
If the number of such prime factors of r, counted with multiplicity, is at least 
2, let q, q' be two of them and set r' = r/ {qq'). 
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If g = g' then qq' has a nontrivial (1, 5)-representation by the inductive 
hypothesis, hence we can apply Lemma [2] and its addendum directly to cancel 

from ttt = {Trr'){q'^) and obtain a (1, 5)-representation of nr' . 

li q =/= q' then qq' has a nontrivial (1, 5)-representation by the inductive 
hypothesis again. Now q^{q')'^Trr' = [qq'){-Kr) has a (1, 5)-representation by the 
Euler identity ([1]), and applying Lemma [2] and its addendum twice implies that 
7rr' has a (1, 5)-reprcscntation — here is the trick used. This finishes the proof of 
Claim IL 

We proceed to prove the inductive step. By minimizing r in (|4]), we are in 
one of the alternatives described in Claim IL 

First, we prove (l)7r- In this case tt = 1,9 mod 20. One must have r = 1 
and hence tt has a (1, 5)-rcpresentation; otherwise, r = = 3,7 mod 20 or 
r = = 2, but then nq' = + = 3, 7, ±2 mod 20 and consequently 

= ±2 mod 5, a contradiction. This proves (l)7r- 

To prove (2)^, suppose tt = 3, 7 mod 20. One must have r ~ q' as described 
in Claim II; otherwise r ~ 1, which implies that irr = + = 3, 7 mod 20, 
a contradiction. Thus irq' has a (1, 5)-reprcsentation, which is automatically 
nontrivial and proper, for some prime q' < n such that cither g' = 3, 7 mod 20 
or q' = 2. Then Remark (f) implies that 7r'^{q')'^ ~ {'Kq'){Tiq') has a nontrivial, 
proper (1, 5)-representation. By the inductive hypothesis, cither q' = 2 or (g')^ 
has a nontrivial (1, 5)-representation. Lemma [2] and its addendum then give a 
nontrivial, proper (1, 5)-representation of tt^. 

To prove the remaining part of (2)^, let g < tt be any prime such that 
either g = 3, 7 mod 20 or g = 2. Then either 5 = q' = 2 or, by the inductive 
hypothesis, qq' has a nontrivial (1, 5)-representation. Thus ■Kq{q'Y ~ [T^<i')[(l' q) 
has a (1, 5)-representation by the Euler identity ([1]). On the other hand, by the 
inductive hypothesis, either g' = 2 or {q'Y has a nontrivial (1, 5)-representation. 
Now Lemma [2] and its addendum give a (1, 5)-representation for irq, which is 
automatically nontrivial. This proves (2)^. 

The theorem is thus proved by induction. □ 

Note that we have proved (iv)-(vi) without reference to (v'). More interesting is 
that in fact (v') follows from (vi). To see this, for any prime p = 3, 7 mod 20, let 
2p= + 5y^. It follows that both x and y are odd. Hence x = 2x' + y, where 
x' may be negative. Then p = 2x'^ + 2x'y + 3y^ gives a desired representation. 

Among many other existing elementary proofs of Fermat's theorem (i), we 
cannot help but mention Zagier's beautiful "one-sentence proof" (see [3], or as 
explained in [1]) to conclude this note. 
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